Bayesian partial factor regression

نویسندگان

  • P. Richard Hahn
  • Carlos Carvalho
  • Sayan Mukherjee
چکیده

A Bayesian linear regression model is developed that cleanly addresses a long-recognized and fundamental difficulty of factor analytic regression – the response variable could be closely associated with the least important principal component. The model possesses inherent robustness to the choice of the number of factors and provides a natural framework for variable selection of highly correlated predictors in high dimensional problems. In terms of out-of-sample prediction, the model is demonstrated to be competitive with partial least squares, ridge regression, and standard factor models under data regimes for which each of those methods excels; thus representing a promising default regression tool. By incorporating pointmass priors on key parameters this model permits variable selection in the presence of highly correlated predictors, as well as estimation of the sufficient dimension, in the p n setting. 1 The Predictor Distribution’s Role in Linear Regression 1.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Revisiting useful approaches to data-rich macroeconomic forecasting

We compare a number of data-rich prediction methods that are widely used in macroeconomic forecasting with a lesser known alternative: partial least squares (PLS) regression. In this method, linear, orthogonal combinations of a large number of predictor variables are constructed such that the covariance between a target variable and these common components is maximized. We show theoretically th...

متن کامل

A Bayesian Nominal Regression Model with Random Effects for Analysing Tehran Labor Force Survey Data

Large survey data are often accompanied by sampling weights that reflect the inequality probabilities for selecting samples in complex sampling. Sampling weights act as an expansion factor that, by scaling the subjects, turns the sample into a representative of the community. The quasi-maximum likelihood method is one of the approaches for considering sampling weights in the frequentist framewo...

متن کامل

Predictor-dependent shrinkage for linear regression via partial factor modeling

In prediction problems with more predictors than observations, it can sometimes be helpful to use a joint probability model, π(Y,X), rather than a purely conditional model, π(Y | X), where Y is a scalar response variable and X is a vector of predictors. This approach is motivated by the fact that in many situations the marginal predictor distribution π(X) can provide useful information about th...

متن کامل

Bayesian and Iterative Maximum Likelihood Estimation of the Coefficients in Logistic Regression Analysis with Linked Data

This paper considers logistic regression analysis with linked data. It is shown that, in logistic regression analysis with linked data, a finite mixture of Bernoulli distributions can be used for modeling the response variables. We proposed an iterative maximum likelihood estimator for the regression coefficients that takes the matching probabilities into account. Next, the Bayesian counterpart...

متن کامل

5 Approximate Nearest Neighbor Regression in Very High Dimensions

Fast and approximate nearest-neighbor search methods have recently become popular for scaling nonparameteric regression to more complex and high-dimensional applications. As an alternative to fast nearest neighbor search, training data can also be incorporated online into appropriate sufficient statistics and adaptive data structures, such that approximate nearestneighbor predictions can be acc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010